Automatic creation of hypertext networks from technical documents
نویسندگان
چکیده
This paper describes a practical method to automatically create hypertext networks from technical structured documents. The physical structure of the documents allows to cut them in nodes when the crossed references determines associative links. The differents steps of the processing line are presented.
منابع مشابه
Lightweight Databases
Current World Wide Web technologies concentrate on presenting documents to human readers. Although HTML identifies structures within a document, it does not allow the semantic content of document sections to be specified explicitly. We investigate a small extension to HTML which allows parts of a document to be mapped onto an underlying database schema. This allows automatic identification and ...
متن کاملAutomated Link Generation: Can we do Better than Term Repetition?
Most current automatic hypertext generation systems rely on term repetition to calculate the relatedness of two documents. There are well-recognized problems with such approaches, most notably, they are vulnerable to the linguistic effects of synonymy (many words for the same concept) and polysemy (many concepts for the same word). I propose a novel method for automatic hypertext generation tha...
متن کاملA text mining approach for automatic construction of hypertexts
The research on automatic hypertext construction emerges rapidly in the last decade because there exists a urgent need to translate the gigantic amount of legacy documents into web pages. Unlike traditional ‘flat’ texts, a hypertext contains a number of navigational hyperlinks that point to some related hypertexts or locations of the same hypertext. Traditionally, these hyperlinks were construc...
متن کاملSnitch: Augmenting Hypertext Documents with a Semantic Net
A new model of hypertext, in which text is augmented with a ne-grained semantic net representation of the text, solves several problems found in traditional hypertext models. In the new model, hypertext links are paths that originate in the text, move across to the semantic net, traverse a sub-path through the semantic net, then return to a diierent point in the text. Beneets of the model inclu...
متن کاملThe Role of Hypertext in CASE Environments
Software development involves a variety of interrelated software documents produced in the diierent phases of the software development process. A CASE environment should maintain these documents and their relationships, and provide support for their consistency. Hyper-text provides a general way to store and present interrelated information. It is natural to expect that hypertext can play a sig...
متن کامل